Sample size determination or estimation is the act of choosing the number of observations or replicates to include in a statistical sample. The sample size is an important feature of any empirical study in which the goal is to make inferences about a population from a sample. In practice, the sample size used in a study is usually determined based on the cost, time, or convenience of collecting the data, and the need for it to offer sufficient statistical power. In complex studies, different sample sizes may be allocated, such as in stratified surveys or experimental designs with multiple treatment groups. In a census, data is sought for an entire population, hence the intended sample size is equal to the population. In experimental design, where a study may be divided into different , there may be different sample sizes for each group.
Sample sizes may be chosen in several ways:
Consider the case where we are conducting a survey to determine the average satisfaction level of customers regarding a new product. To determine an appropriate sample size, we need to consider factors such as the desired level of confidence, margin of error, and variability in the responses. We might decide that we want a 95% confidence level, meaning we are 95% confident that the true average satisfaction level falls within the calculated range. We also decide on a margin of error, of ±3%, which indicates the acceptable range of difference between our sample estimate and the true population parameter. Additionally, we may have some idea of the expected variability in satisfaction levels based on previous data or assumptions.
In some situations, the increase in precision for larger sample sizes is minimal, or even non-existent. This can result from the presence of or strong dependence in the data, or if the data follows a heavy-tailed distribution, or because the data is strongly dependent or biased.
Sample sizes may be evaluated by the quality of the resulting estimates, as follows. It is usually determined on the basis of the cost, time or convenience of data collection and the need for sufficient statistical power. For example, if a proportion is being estimated, one may wish to have the 95% confidence interval be less than 0.06 units wide. Alternatively, sample size may be assessed based on the power of a hypothesis test. For example, if we are comparing the support for a certain political candidate among women with the support for that candidate among men, we may wish to have 80% power to detect a difference in the support levels of 0.04 units.
The estimator of a proportion is , where X is the number of 'positive' instances (e.g., the number of people out of the n sampled people who are at least 65 years old). When the observations are independent, this estimator has a (scaled) binomial distribution (and is also the sample arithmetic mean of data from a Bernoulli distribution). The maximum variance of this distribution is 0.25, which occurs when the true parameter is p = 0.5. In practical applications, where the true parameter p is unknown, the maximum variance is often employed for sample size assessments. If a reasonable estimate for p is known the quantity may be used in place of 0.25.
As the sample size n grows sufficiently large, the distribution of will be closely approximated by a normal distribution.NIST/SEMATECH, "7.2.4.2. Sample sizes required", e-Handbook of Statistical Methods. Using this and the Wald method for the binomial distribution, yields a confidence interval, with Z representing the standard Z-score for the desired confidence level (e.g., 1.96 for a 95% confidence interval), in the form:
To determine an appropriate sample size n for estimating proportions, the equation below can be solved, where W represents the desired width of the confidence interval. The resulting sample size formula, is often applied with a conservative estimate of p (e.g., 0.5):
for n, yielding the sample size , in the case of using 0.5 as the most conservative estimate of the proportion. (Note: W/2 = margin of error.)
In the figure below one can observe how sample sizes for binomial proportions change given different confidence levels and margins of error.
Otherwise, the formula would be , which yields . For example, in estimating the proportion of the U.S. population supporting a presidential candidate with a 95% confidence interval width of 2 percentage points (0.02), a sample size of (1.96)2/ (0.022) = 9604 is required with the margin of error in this case is 1 percentage point. It is reasonable to use the 0.5 estimate for p in this case because the presidential races are often close to 50/50, and it is also prudent to use a conservative estimate. The margin of error in this case is 1 percentage point (half of 0.02).
In practice, the formula : is commonly used to form a 95% confidence interval for the true proportion. The equation can be solved for n, providing a minimum sample size needed to meet the desired margin of error W. The foregoing is commonly simplified: "Confidence Interval for a Proportion" n = 4/ W2 = 1/ B2 where B is the error bound on the estimate, i.e., the estimate is usually given as within ± B. For B = 10% one requires n = 100, for B = 5% one needs n = 400, for B = 3% the requirement approximates to n = 1000, while for B = 1% a sample size of n = 10000 is required. These numbers are quoted often in news reports of and other . However, the results reported may not be the exact value as numbers are preferably rounded up. Knowing that the value of the n is the minimum number of Elementary event needed to acquire the desired result, the number of respondents then must lie on or above the minimum.
In a precisely mathematical way, when estimating the population mean using an independent and identically distributed (iid) sample of size n, where each data value has variance σ2, the standard error of the sample mean is:
This expression describes quantitatively how the estimate becomes more precise as the sample size increases. Using the central limit theorem to justify approximating the sample mean with a normal distribution yields a confidence interval of the form
To determine the sample size n required for a confidence interval of width W, with W/2 as the margin of error on each side of the sample mean, the equation
.
For instance, if estimating the effect of a drug on blood pressure with a 95% confidence interval that is six units wide, and the known standard deviation of blood pressure in the population is 15, the required sample size would be , which would be rounded up to 97, since sample sizes must be integers and must meet or exceed the calculated minimum value. Understanding these calculations is essential for researchers designing studies to accurately estimate population means within a desired level of confidence.
All the parameters in the equation are in fact the degrees of freedom of the number of their concepts, and hence, their numbers are subtracted by 1 before insertion into the equation.
The equation is:
For example, if a study using laboratory animals is planned with four treatment groups ( T=3), with eight animals per group, making 32 animals total ( N=31), without any further stratification ( B=0), then E would equal 28, which is above the cutoff of 20, indicating that sample size may be a bit too large, and six animals per group might be more appropriate. Isogenic.info > Resource equation by Michael FW Festing. Updated Sept. 2006
and an alternative hypothesis:
for some 'smallest significant difference' μ* > 0. This is the smallest value for which we care about observing a difference. Now, for (1) to reject H0 with a probability of at least 1 − β when
Ha is true (i.e. a power of 1 − β), and (2) reject H0 with probability α when H0 is true, the following is necessary:
If z α is the upper α percentage point of the standard normal distribution, then
and so
is a decision rule which satisfies (2). (This is a 1-tailed test.) In such a scenario, achieving this with a probability of at least 1−β when the alternative hypothesis Ha is true becomes imperative. Here, the sample average originates from a Normal distribution with a mean of μ*. Thus, the requirement is expressed as:
Through careful manipulation, this can be shown (see Statistical power Example) to happen when
where is the normal cumulative distribution function.
There are many reasons to use stratified sampling:Kish (1965, Section 3.1) to decrease variances of sample estimates, to use partly non-random methods, or to study strata individually. A useful, partly non-random method would be to sample individuals where easily accessible, but, where not, sample clusters to save travel costs.Kish (1965), p. 148.
In general, for H strata, a weighted sample mean is
The weights, , frequently, but not always, represent the proportions of the population elements in the strata, and . For a fixed sample size, that is ,
which can be made a minimum if the sampling rate within each stratum is made
proportional to the standard deviation within each stratum: , where and is a constant such that .
An "optimum allocation" is reached when the sampling rates within the strata
are made directly proportional to the standard deviations within the strata
and inversely proportional to the square root of the sampling cost per element
within the strata, :
where is a constant such that , or, more generally, when
Unlike quantitative research, qualitative studies face a scarcity of reliable guidance regarding sample size estimation prior to beginning the research.
Imagine conducting in-depth interviews with cancer survivors, qualitative researchers may use data saturation to determine the appropriate sample size. If, over a number of interviews, no fresh themes or insights show up, saturation has been reached and more interviews might not add much to our knowledge of the survivor's experience. Thus, rather than following a preset statistical formula, the concept of attaining saturation serves as a dynamic guide for determining sample size in qualitative research. There is a paucity of reliable guidance on estimating sample sizes before starting the research, with a range of suggestions given.Emmel, N. (2013). Sampling and choosing cases in qualitative research: A realist approach. London: Sage. In an effort to introduce some structure to the sample size determination process in qualitative research, a tool analogous to quantitative power calculations has been proposed. This tool, based on the negative binomial distribution, is particularly tailored for thematic analysis.Galvin R (2015). How many interviews are enough? Do qualitative interviews in building energy consumption research produce reliable knowledge? Journal of Building Engineering, 1:2–12.
where:
Cumulative distribution function
Stratified sample size
with
Qualitative research
See also
General references
Further reading
External links
|
|